Rank in Wordlist | Frequency | Word |
---|---|---|
1685 | 9 | 1,5 |
2176 | 7 | 2,5 |
2941 | 5 | 1,4 |
3657 | 4 | 0,5 |
3658 | 4 | 1,6 |
4825 | 3 | 2,2 |
4837 | 3 | 3,5 |
4840 | 3 | 4,5 |
6424 | 3 | see,et |
6955 | 2 | 1,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
12765 | 1 | 17(A),18,23,35,36 |
13453 | 1 | 5800(telefon |
15540 | 1 | Intercooler(jahutab |
16717 | 1 | Liidu(ETLL |
17140 | 1 | Mees(tm |
18397 | 1 | Rail(way)s |
19558 | 1 | Timsenko(või |
19568 | 1 | Toakaaslas(t)ega |
21257 | 1 | arvamuse(d |
22601 | 1 | elektrijaama(de |
Rank in Wordlist | Frequency | Word |
---|---|---|
12765 | 1 | 17(A),18,23,35,36 |
18397 | 1 | Rail(way)s |
19568 | 1 | Toakaaslas(t)ega |
20469 | 1 | Zelazko). |
21234 | 1 | arust),kust |
24000 | 1 | hommikul),ilmselt |
24106 | 1 | hulgi)müügi |
24897 | 1 | jne),» |
26211 | 1 | keele-)filosoofia |
26404 | 1 | keskkonna)põhimõtetele |
Rank in Wordlist | Frequency | Word |
---|---|---|
1427 | 11 | 100% |
2170 | 7 | 10% |
2184 | 7 | 50% |
2944 | 5 | 20% |
2947 | 5 | 30% |
2952 | 5 | 5% |
2956 | 5 | 80% |
3681 | 4 | 90% |
4816 | 3 | 1% |
4834 | 3 | 25% |
Rank in Wordlist | Frequency | Word |
---|---|---|
7849 | 2 | S&D |
13928 | 1 | Anne&Stiil |
17767 | 1 | P&P |
Rank in Wordlist | Frequency | Word |
---|---|---|
7318 | 2 | GoBus'i |
7614 | 2 | MSTS'i |
7615 | 2 | MSTS'ist |
12562 | 1 | 10't |
12789 | 1 | 18'000 |
12935 | 1 | 2'e |
13275 | 1 | 350Z'i |
13589 | 1 | 80's |
13639 | 1 | 9'l |
13832 | 1 | Aladdin's |
Rank in Wordlist | Frequency | Word |
---|---|---|
2001 | 8 | km/h |
3173 | 5 | ja/või |
6964 | 2 | 1049/2001 |
6982 | 2 | 160km/h |
6989 | 2 | 185/65 |
7025 | 2 | 2000/78 |
7058 | 2 | 30km/h |
12558 | 1 | 1/16-finaalis |
12559 | 1 | 1/2 |
12560 | 1 | 1/4000 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots